Data-driven Offline Reinforcement Learning for HVAC-systems

نویسندگان

چکیده

This paper presents a novel framework for Offline Reinforcement Learning (RL) with online fine tuning Heating Ventilation and Air-conditioning (HVAC) systems. The method to do pre-training in black box model environment, where the models are built on data acquired under traditional control policy. focuses application of Underfloor (UFH) an air-to-water-based heat pump. However, should also generalize other HVAC applications. Because Black methods used is there little no commissioning time when applying this buildings/simulations beyond one presented study. explores deploys Artificial Neural Network (ANN) based design efficient controllers. Two ANN tested paper; Multilayer Perceptron (MLP) Long Short Term Memory (LSTM) method. It found that LSTM-based reduces prediction error by 45% compared MLP model. Additionally, different network architectures tested. creating new each step, performance can be improved additionally 19%. By using these paper, it shown Multi-Agent RL algorithm deployed without ever performing worse than industrial controller. Furthermore, if building from Building Management System (BMS) available, agent which performs close optimally first day deployment. An optimal policy cost heating 19.4 % simulation paper.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Autonomous HVAC Control, A Reinforcement Learning Approach

Recent high profile developments of autonomous learning thermostats by companies such as Nest Labs and Honeywell have brought to the fore the possibility of ever greater numbers of intelligent devices permeating our homes and working environments into the future. However, the specific learning approaches and methodologies utilised by these devices have never been made public. In fact little inf...

متن کامل

Offline Evaluation of Online Reinforcement Learning Algorithms

In many real-world reinforcement learning problems, we have access to an existing dataset and would like to use it to evaluate various learning approaches. Typically, one would prefer not to deploy a fixed policy, but rather an algorithm that learns to improve its behavior as it gains more experience. Therefore, we seek to evaluate how a proposed algorithm learns in our environment, meaning we ...

متن کامل

Emotion-Driven Reinforcement Learning

Existing computational models of emotion are primarily concerned with creating more realistic agents, with recent efforts looking into matching human data, including qualitative emotional responses and dynamics. In this paper, our work focuses on the functional benefits of emotion in a cognitive system where emotional feedback helps drive reinforcement learning. Our system is an integration of ...

متن کامل

Reinforcement Learning for CPG-Driven Biped Robot

Animal’s rhythmic movements such as locomotion are considered to be controlled by neural circuits called central pattern generators (CPGs). This article presents a reinforcement learning (RL) method for a CPG controller, which is inspired by the control mechanism of animals. Because the CPG controller is an instance of recurrent neural networks, a naive application of RL involves difficulties. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Energy

سال: 2022

ISSN: ['1873-6785', '0360-5442']

DOI: https://doi.org/10.1016/j.energy.2022.125290